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AMPLIFIED SEQUENCE POLYMORPHISMS (ASPs) 

This application is a continuation-in-part of 
application Serial No. 187,428 filed April 28, 1988 entitled 
"Gene Mapping and Genotyping With PCR Polymorphisms" and 
incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

The current initiative to map and sequence the entire 
human genome portends a detailed molecular understanding of 
all human genetic diseases. This initiative is derived in 
part from the use of restriction fragment length 
polymorphisms (RFLPs) as genetic markers for mapping the 
human genome. 1/ RFLPs are found by screening cloned DNA 
sequences for their ability to reveal sequence 
polymorphisms. Restriction enzyme digestion of genomic DNA 
creates fragment lengths which differ according to the 
presence or absence of a given restriction site. The 
fragments are separated by length on a gel, blotted, and 
hybridized with labeled cloned DNA to display the fragment 
length polymorphism for homologous fragments. Multiple 
consecutive hybridizations of such probes to "reusable" 
Southern blots containing the DNA from appropriate pedigrees 
are required to map disease loci. 

A number of strategies to increase the productivity of 
gene mapping and genotyping with RFLPs have been developed. 

Y/ Botstein, D. , et al . , Construction of a genetic 
linkage map in man using restriction fragment length 
polymorphisms, Amer. J. Hum. Genet. 32:314-331 (1980). 
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Somatic cell hybrids and sorted chromosomes have been used 
to isolate markers on specific chromosomes* 1/ The Mspl 
and TagI restriction enzymes reveal RFLPs with a higher 
frequency than other restriction enzymes because their 
recognition sequence contains the highly mutable CpG 
dimer.l/ Highly polymorphic RFLPs are found in 
restriction fragments which span tandem repetitive 
sequences, such as those associated with the insulin 
gene!/ and the H-ras gene.l/ Jeffreys et al.iL/ discovered 
a series of polymorphisms by screening with a sequence 
homologous to many tandem repeat sequences. 

2/ Gusella, J.F., et al., Isolation and localization of 
DNA segments from specific human chromosomes, Proc . Natl . 
Acad. Set,., USA 77:2829-2833 (1980); Davies, K.E., et al. , 
Cloning of a representative genomic library of the human X 
chromosome after sorting by flow cytometry , Nature 
(London) 293 ; 374-376 (1981). 

3/ Barker, D . , et al. , Restriction sites containing CpG 
show a higher frequency of polymorphisms in human DNA, 
Cell 36:131-138 (1984). 

4/ Bell, G.I., et al., The highly polymorphic region near 
the human insulin gene is composed of simple tandemly 
repeating sequences, Nature f London) 295 :31-35 (1982). 

5/ Capon, D.J., et al., Complete nucleotide sequences of 
the T24 human bladder carcinoma oncogene and its normal 
homologue, Nature (London) 302 :33-37 (1983). 

6/ Jeffreys, A.J., et al, Hypervariable "minisatellite" 
regions in human DNA, Nature (London) 314 ; 67-73 (1985) ; 
Jeffreys, A. J. , et al., Individual-specific "fingerprints" 
of human DNA, Nature (London) 316 :76-79 (1985). 
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Oligonucleotide hybridization is an important 
technology in the analysis of sequence differences.!/ 
Nakamura et al.iL/ combined the Jeffreys and Wallace 
technologies to create an efficient strategy for isolating 
a large number of markers whose polymorphism is based on a 
variable number of tandem repeats (VNTRs) . 

Despite this significant progress and the creation of 
the lymphoblastoid cell bank of the CEPH reference 
families for gene mapping,!/ much work remains for all of 
the markers to be well mapped with respect to one another 
and for the more than 3000 genetic diseases^/ to be 
mapped with respect to the markers. A more efficient 

77 Wallace, R.B. et al., Hybridization of synthetic 
oligodeoxyribonucleotides to ^X174 DNA: The effect of 
single base pair mismatch, Nucleic Ac ids Res. 6:3543-3557 
(1979); Conner, B.J., et al., Detection of sickle cell 
^s_giobin allele by hybridization with synthetic 
oligonucleotides, Proc. Natl. Acad. Sci. USA 80:278-282 
(1983) . 

8/ Nakamura, Y . , et al. , Variable number tandem repeat 
(VNTR) markers for human gene mapping, Science 
235:1616-1622 (1987). 

9/ Dausset, J., Le Centre d' etude du polymorphisme 
huroain, Presse Med. 15:1801-1802 (1986). 

10/ McKusick, V.A., Mendelian Inheritance in Man: 
Catalogs of Autosomal Dominant, Autosomal Recessive, and 
X-Linked Phenotypes, 7th ed. John Hopkins Univ. Press, 
Baltimore (1986) . 
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means of mapping new disease and marker loci is needed • 
This mapping will lead to the disease causing gene. Once 
the defective gene is isolated, a number of abnormal 
genotypes are expected to be found. This invention 
comprises a systematic approach to gene mapping or 
genotyping based on the analysis, which may be 
simultaneous, of amplified sequence polymorphisms (ASPs) . 

Recently, Saiki, Mullis, and collaborators have 
demonstrated the specific amplification of short genomic 
DNA sequences by incubating DNA, DNA polymerase, and 
flanking oligonucleotide primers complementary to opposite 
strands of the DNA followed by sequential rounds of DNA 
denaturation and polymerization. 11/ The efficiency of 
each round of . polymerization approaches 100% producing an 
amplification after n rounds of nearly 2 n . This 
technique, known as polymerase chain reaction (PCR) , has 
been used to detect the sickle cell mutation, 11/ 

11/ Saiki, R.D., et al., Enzymatic amplification of 
/9-globin genomic sequences and restriction site analysis 
for diagnosis of sickle cell anemia, Science 230 ; 1350-1354 
(1985); Mullis, K. , et al., Specific enzymatic 
amplification of DNA in vitro: The polymerase chain 
reaction, Cold Spring Harbor Symp. Quant. Biol. 51:263-273 



12/ Saiki, R. D. , supra , n. 11; Chehab, F.F., et al., 
Detection of sickle cell anemia and thalassaemias, Natur< 
(London) 329 :293-294 (1987); Embury, S.H., et al., Rapid 
prenatal diagnosis of sickle cell anemia by a new method 
of DNA analysis, N. Engl. J» Ned. 316:656-661 (1987). 



(1986) . 
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0-thalassemia mutations, 12/ and H-ras mutations.il/ 
Higuchi, et al.15/ haV e shown the feasibility of using PCR 
to amplify DNA obtained from a single hair to detect 
polymorphisms in mitochondrial DNA and HLA class II 
genes. These studies have focused on the ability of PCR 
to amplify the amount of DNA in the sample. 

Allele specific PCR (AS-PCR) is described in Wallace, 
et al. application Serial No. 283,142 filed December 12, 
1988 and incorporated herein by reference. 

Another enzymatic amplification strategy using DNA 
ligase is described in Wallace application Serial No. 
178,377 filed April 6, 1988 and incorporated herein by 
reference. 

In the ligation amplification reaction (LAR) , two 
pairs of oligonucleotides are ligated. One pair is 
complementary to adjacent nucleotides on the upper strand 
and the other pair is complementary to adjacent 

13/ wong, C. , et al., Characterization of 0-thalassaemia 
mutations using direct genomic sequencing of amplified 
single copy DNA, Nature (London) 330:384-386 (1987). 

14/ Farr, C. J. , et al., Analysis of RAS gene mutations in 
acute myeloid leukemia by polymerase chain reaction and 
oligonucleotide probes, Proc. Natl. Acad . Sci. USA 
85:1629-1633 (1988). 

15/ Higuchi, R., et al., DNA typing from single- hairs, 
Nature (London) 332 :543-546 (1988). 
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nucleotides of the lower strand of the target DNA. The 
amplification product accumulates exponentially because 
the products of each round of ligation serve as templates 
for subsequent rounds. The LAR is favored by perfect base 
pairing at the ligation site. Under appropriate 
conditions in the presence of sequence variation, the 
reaction proceeds for one allele but not for others . 

Definitions 

For the purposes of this application, the following 
definitions apply. 

Polynucleotide — A nucleic acid molecule composed of 
more than one nucleotide, e.g. , RNA or DNA. 

Amplified Sequence Polymorphism (ASP) — The product of 
amplification as by the PCR or the LAR of a polymorphic 
target polynucleotide sequence having a preselected locus 
specific characteristic (LSC), e.g., restriction fragment 
length, allele specif ic hybridization capability, gel 
mobility or a colorimetric or isotopic label. 

Marker — The preselected LSC of an ASP. 

Locus Specific Fragment (LSF) — A fragment of an 
amplified polynucleotide having or including an LSC. 

Class I ASP— An ASP in which the LSC is the presence 
or absence of a. restriction endonuclease site. 



WO 89/10414 



PCT/US89/01807 



-8- 

The length of amplified target sequences or of LSFs is 
determined by choice and position of primers used for PCR 
or for allele specific PCR (AS-PCR) . The presence or 
absence of alleles at each locus may be determined by 
allele specific primer extension (AS-PE) using primers of 
predetermined length. The length of an AS-PE product is 
characteristic of the locus, and the presence or absence 
of such product is characteristic of the allele of the 
locus. Preferably, multiple target sequences are 
simultaneously amplified, and the amplification products 
simultaneously analyzed. 

The invention also includes means for predetermining 
appropriate PCR primer lengths and a rapid and simple 
method for preparing DNA for ASP analysis. 

Target Sequence Amplification Primers 

PCR and LAR amplification primers may be selected or 

f 

developed either from published information or by 
sequencing the DNA segments flanking a particular 
polymorphism. 

This aspect of the invention importantly involves the 
determination of a temperature at which a primer of a 
given length will prime for a template base of any base 
composition. This determination is based upon a- 
relationship between the stability of the duplex of an 
oligonucleotide primer and a template and the 
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Class II ASP — An ASP in which LSC is a preselected 
nucleotide deletion or insertion variation in a DNA target 
sequence or a restriction fragment length polymorphism 
based on the presence of VNTRs. 

Class III ASP — An ASP in which the LSC is a base pair 
substitution or a small consistent nucleotide deletion not 
creating an RFLP. 

Summary of the Invention 
ASP technology provides amplified polynucleotide 
target sequences containing the variation of a polymorphic 
locus. The amplified target sequences may be 
distinguished inter se by a unique physical property, thus 
facilitating simultaneous, automatic analysis for multiple 
loci. A preferred physical property is the length or 
number of nucleotides in the entire amplified target 
sequence producing an LSF. 

The spacing between DNA sequences on a gel is a 
function of sequence size. ASP technology permits 
preselection of sequence size with the result that such 
gel spacing can be maintained substantially constant. In 
addition, preselected sequence size differences can be 
resolved on the basis of electrophoresis time over a fixed 
distance rather than by location on a gel. 
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oligonuclebtide's ability to prime in a PCR at a given 
polymerization temperature. This consideration is 
important because the temperature of polymerization and 
annealing when PCR is done with a thermostable enzyme, 
e.g., Therrous aauaticus (Tag), may exceed the temperature 
at which the primer is stably hybridized to the template. 

It is reported that in a primer-template duplex the 
stability contributed by a G:C base pair is twice that 
contributed by an A:T base pair. IV On that assumption, a 
normalized length, Ln, can be determined by the equation 
^(number of A and T) + 2 (number of G and C). Based on 
a systematic study of the temperature at which various 
primer-template combinations yield a successful PCR 
product as well as temperatures at which no amplification 
is observed, a priming temperature at which a primer with 
a certain length and base composition can be determined. 
For example, Figure 1 is a curve showing the maximum 
temperature at which a given primer yielded an 
amplification product as a function of the normalized 

167 Suggs, et al. in "Developmental Biology using 
Purified Genes, ICN-UCIA Symposia on Molecular and 
Cellular Biology," D.D. Brown and C.F. Fox (eds.). Use of 
synthetic oligonucleotides for the isolation of specific 
cloned DNA sequences. Academic Press , New York, Vol. 23, 
No. 54, pp. 683-693 (1981). 
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length (0) as well as a curve showing the minimum 
temperature at which no amplification was seen for a given 
primer as a function of normalized length (•■). 

The Figure 1 curves reflect data utilizing the 
primer?- template sets set forth in Table I. 
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AS-PCR 

AS-PCR provides direct determination of genotype by 
the presence or absence of a specific amplified sequence. 
Two allele specific oligonucleotide primers, one specific 
for a variant allele and one specific for a normal allele, 
together with another primer complementary to both 
alleles, are used in the PCR with genomic DNA templates. 
The allele specific primers differ from each other, for 
example, in their terminal 3' nucleotide. Under 
appropriate primer annealing temperature and PCR 
conditions, these primers only direct amplification on 
their complementary allele. 

An important aspect of this invention therefore is the 
use of AS-PCR to produce ASPs because a single step yields 
not only a locus but also an LSF. Simultaneous analysis 
of multiple loci is facilitated. For example, with 10 
sets of primers, 10 individual PCRs may be conducted or 
the 10 PCRs may be conducted simultaneously. The size of 
the 10 locus specific sequences in the amplification 
product may be preset in the range of 100 to 500 base 
pairs. If the length (L) of the LSF for the ith locus is 
related to the length of locus i+l as follows 

log (L(i +1 )=log Li + 0.078 (for 10 loci 100-500 bp) 
then the LSAs will be equidistant from each other when 
resolved on a gel. 
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Figure 2 illustrates a hypothetical result. PI and P2 
referenced in Figure 2 are the primer sets specific for 
allele 1 and allele 2 of the particular polymorphic 
locus. Ten loci each with a different length between 100 
and 500 base pairs are shown. The hypothetical individual 
is heterozygous for loci 1, 3, 6, 8 and 10. 

AS-PE 

Another important aspect of the invention involves 
allele specific primer extension (AS-PE) to detect allele 
variation in PCR amplified target sequences. A DNA 
polymerase, e.g. Tag polymerase, is used in a primer 
extension reaction to test for the presence or absence of 
a particular nucleotide involved in a polymorphism. The 
following description elucidates this aspect of the 
invention utilizing sickle cell anemia as a model for 
single nucleotide polymorphism. 

As shown in Figure 3, ASP-A is a 19-nucleotide long 
primer that is complementary to the antisense strand of 
the /9-globin gene. The primer anneals to the 0-globin 
gene sequence immediately 3' and adjacent to the 
nucleotide involved in the A->T transversion mutation of 
the sickle cell allele. In a primer extension reaction in 
which only a single cr-[ 3 ?P]dNTP is added, the £ A allele 
will direct the incorporation of dATP and the p s allele 
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will direct the incorporation of dTTP. Thus, the primer 
extension reaction gives rise to a labeled oligonucleotide 
one nucleotide longer than the primer and in an allele 
specific way. 

The length of the oligonucleotide used in the primer 
extension reaction can be varied, either by making it 
complementary to more nucleotides of the template or, 
preferably, by maintaining the extent of template 
complementarity and adding additional, non-template 
complementary nucleotides to the 5 ' end of the primer. In 
the latter case, multiple AS-PE reactions can be analyzed 
on a single gel. Multiple, independent AS-PE extensions 
can be accomplished in a single reaction. 

In a typical AS-PE experiment, 3 pi of a 25 cycle, PCR 
amplified £-globin DNA sample that has been gel purified 
to remove the PCR primers was used as the template for the 
single nucleotide extension of the ASP-A primer. AS-PE 
reactions were performed with 0.5 pM ASP-A in the presence 
of 50 mM KC1, 10 mM Tris-HCl (pH 8-3), 1.5 mM MgCl 2 , 0.01% 
(w/v) gelatin, 2.5 units of Tag polymerase, and either 3 
pM of [a- 32 P]dATP or [a- 32 P]TTP (300 Ci/mmol) . 10 pi of 
mineral oil was layered on top of the reaction mixture (10 
pi total volume) to prevent evaporation. The sample was 
then heated for 5 min at 95 °C and allowed to incubate at 
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69 °C for 2 hr. At the end of the incubation, 3 /il was 
removed from the reaction mixture and subjected to 
electrophoresis on a 7 M urea 20% polyacrylamide gel. The 
results are shown in Figure 4. Note the absence of a band 
in lane b and lane c. 

In the preferred practice of the invention, a 
plurality of target sequences are simultaneously 
amplified. Both non-VNTR loci and VNTR loci may be 
simultaneously amplified by a single PGR reaction without 
serious compromise of the amplification efficiency of 
individual loci. 

To test whether three non-VNTR loci could be 
simultaneously amplified by PCR in a single reaction 
without an effect on amplification efficiency of 
individual loci, three well characterized single copy 
genes, 0-globin, Kirsten ras oncogene (K-ras) and growth 
hormone (hGH) , were chosen. PCR was carried out using 
purified normal human leukocyte DNA as the template. The 
primer sets used were all 19 or 20 nucleotides in length. 
Under appropriate conditions (55°C/2 min annealing, 
72°C/3 min polymerization using Tag DNA polymerase, and 
94°C/1 min denaturation, for 20 cycles) , the primer set(s) 
used single or in combination with the others, each 
directed the amplification of a single specific major DNA 
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fragment. The reaction mixture (50 jxl) contained in 
addition to the appropriate buffer, 0.5 pg template DNA, 
0.1 mM each of dATP, dCTP, dGTP, and TTP f 12.5 pmol of 
each primer and 2.5 units of Tag DNA polymerase (Cetus) . 
These amplified DNA fragment (s) were recognizable by 
ethidium bromide staining following agarose gel 
electrophoresis and did not require any hybridization 
step. Under appropriate conditions and using suitable 
primer sets, multiple, e.g. , six to ten, different DNA 
fragments may be amplified in one reaction mixture. 

A similar experiment was done with three VNTR loci 
33.6 (Jeffreys, supra ) , 33.1 (Jeffreys, supra ) and H-ras. 
Specifically, in a 50 ^1 reaction, 7.5 units Tag 
polymerase, 25 pmol of each primer and 20 cycles of PCR 
with the program 67°C/15 sec, 72°C/6 min, 94°C/1 min. It 
was possible to detect the alleles of all three loci by 
visualization with ethidium bromide staining or by 
blotting and hybridization with a mixture of probes 
specific for the three VNTR loci. 

Class 1 ASPs 

Class I ASPs are generally illustrated by Figure 5. 
The figure illustrates two oligonucleotide primers (Pi and 
P2) complementary to opposite strands of DNA flanking 
restriction site polymorphisms in a DNA target sequence. 
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Four PCR amplified target sequences 1, 2, 3 and 4 are 
shown. Only sequences 1 and 2 include the amplified 
restriction site. The presence or absence of the 
restriction site in amplification product is revealed by 
digestion with the appropriate enzyme and determining the 
size of the restriction fragments in known manner. 

Class II ASPs 

Class II ASPs are generally illustrated by Figure 6 in 
which PI and P2 are oligonucleotide primers as described 
with reference to Figure 5. The PCR amplification product 
of two DNA sequences of different length (number of 
nucleotides) is shown. The length difference reflects the 
deletion or insertion of one or more nucleotides or the 
number of tandem repeats of a VNTR. Class II ASPs based 
on the presence of VNTRs are valuable for association 
studies between specific alleles at the polymorphism 
marker locus and a disease such as the analysis of the 
association between the H-Ras tandem repeat and cancer. 

Example 1 

Primers complementary to the single copy DNA flanking 
the VNTR region were chosen for the 33.6 locus (Jeffreys, 
1985, supra ) . 
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DNA (250 ng) from 4 members of a family (father, 
mother, and two sons) were amplified in a 50 pi reaction 
containing 12.5 pmol of each primer, 200 /iM dNTPs, 2.5 
units Tag polymerase and the appropriate buffer. 
Reactions were overlaid with oil and subjected to 20 
cycles of PCR with the following conditions: 67°C/15 sec, 
72°C/6 min, 94°C/1 min. The products of the reaction were 
then subjected to electrophoresis on a 1.5% agarose gel, 
stained with ethidium bromide and visualized by 
photographing under UV light. The DNA fragments were then 
transferred to a nylon membrane (Genetran) by blotting and 
then hybridized with a probe specific for the VNTR 
region. After washing, the filter was then exposed to 
X-ray film. 

Figure 7 is a reproduction of the exposed X-Ray film. 
As the figure shows, in this highly polymorphic locus, 
three alleles are seen in this family, the father with 
alleles 1 and 2 and the mother with allele 3 
(homozygous) . The sons have inherited the alleles from 
the parents in a mendelian fashion. The example 
represents the generation of Class II ASPs for this 3 3.6 
locus. 
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Example 2 

Primers were selected in the manner described in 
Example 1. 

DNA (250 ng) from 14 unrelated individuals was 
amplified in a 50 /il reaction containing 12-5 pmol of each 
primer, 200 pm dNTPs, 2.5 units Tag polymerase and the 
appropriate buffer- Reactions were overlaid with oil and 
subjected to 20 cycles of PCR with the following 
conditions: 67°C/15 sec, 72°C/6 min, 94°C/1 min. The 
products of the reaction were then subjected to 
electrophoresis on a 1.5% agarose gel, stained with 
ethidium bromide and visualized by photographing under UV 
light. The DNA fragments were then transferred to a nylon 
membrane (Genetran) by blotting and then hybridized with a 
probe specific for the VNTR region. After washing, the 
filter was scanned on an Ambis Mark II radioactive 
scanner. The results are shown by Figure 8. As the 
figure shows, of these 14 individuals, only one appears to 
be homozygous for the highly polymorphic locus. This 
example represents the generation of class II ASPs for 
this locus. 
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In like manner, the loci for (i) H-ras VNTR, 
(ii) insulin gene. VNTR, (iii) 33.1 (Jeffreys, supra ) , 
(iv) 33.4 (Jeffreys, supra ) , (v) 3'a-globin VNTR, and 
(vi) ^r-globin VNTR have been converted into Class II ASPs. 

Class III ASPs 

Class III ASP generation is typified by Figure 9 in 
which PI and P2 are primer oligonucleotides as described 
with reference to Figure 5. Four amplified DNA sequences 
which include 2 alleles are shown. Hybridization with 
allele specific oligonucleotide (ASO) probes 
simultaneously reveals the location of the allele 
including the preselected locus specific characteristic as 
shown by Figure 9. Figure 9 shows one allele that 
hybridizes ( — ) to an oligonucleotide specific for allele 
1 and does not hybridize (_/u) with the oligonucleotide 
specific for allele 2 and a second allele that hybridizes 
in the opposite way. 

Class III ASPs can also be visualized by hybridizing a 
blot containing the fragments with a mixture of 
oligonucleotides, one specific for one allele of each 
relevant locus. For example, a labelled oligonucleotide 
probe specific for allele 1 may be mixed with a ten- fold 
molar excess of unlabeled probe specific for allele 2. 
Competition hybridization as described in parent 
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application Serial No. 187,428 with such a mixed probe 
assures absolute discrimination between the two 
alleles.il/ After one allele for each locus has been 
visualized, the probe can be removed and the blot 
rehybridized with a probe mixture to the other alleles. 

Example 3 

Oligonucleotides having the sequences shown in 
Table II were synthesized and purified as described 
previously. 

18/ 

TABLE II 

Oliqo Genea Sequence (5 / ->3 / ) 

H/319A pK CTCCTGAGGAGAAGTCTGC 

H/319S ^S CTCCTGTGGAGAAGTCTGC 

a The oligos are perfectly complementary either to the 

normal human /3-globin gene (/i A ) or to the sickle cell 

0-globin gene (0 s ) . 



17/ see Nozari, G., et al., Discrimination among the 
transcripts of the allelic human /9-globin genes 0 A , and 
p c using oligodeoxynucleotide hybridization probes, Gene 

43 :23-28 (1986) . 

18/ Wu and Wallace, The ligation amplification reaction 
Tlar) -amplification of specific DNA sequences using rounds 
of template dependent ligation, Genomics , in press (1988). 
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Normal (£ A , j9 A ) , sickle cell disease (0 s , p s ) , and 
sickle cell trait (0 A , 0 s ) genomic DNA samples were 
prepared as described by Wu and Wallace, supra , n. 18. 

thalassemia major DNA was prepared from EBV transformed 
lymphocytes in culture (GM2267 cells from NIGMS Human 
Genetic Mutant Cell Repository, Camden N.J.). The cells 
were derived from a patient with a homozygous deletion of 
the and /9-globin genes. Both plasmid and genomic DNA 
samples were treated with restriction enzymes (Bam HI 
and/ or Tag I) and Exo III nuclease prior to serving as 
templates in ligation reactions. 

PCR primers , BGP1 and BGP2 , are 19 nucleotide long 
synthetic oligonucleotides which anneal to opposite 
strands of the £-globin gene at positions 256 nucleotides 
apart (Chehab et al., 1987, supra) . 1 pq of genomic DNA 
samples are routinely used as template in these 
reactions. Amplification reactions are performed 
accordingly with the Polymerase Chain Reaction Kit 
(Perkin-Elmer-Cetus) . including 2.5 unit of Tag DNA 
polymerase and 2.5 of oligonucleotide primers. 30 
rounds of DNA amplification gave approximately 5 X10 5 fold 
amplification. A 10 /il aliquot of the reaction mixture 
was subsequently subjected to electrophoresis on a* 1.5% 
agarose gel at 60 V for 5 hr and transferred to a Genetran 
nylon membrane using 20 X SSC according to Southern 
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(Southern, 1975) • 10 pi aliquots of the reaction mixture 
are used as template for LAR. 

The Genetran membrane was hybridized to labeled 
oligonucleotide probes, H019S and H019A, in 5X SSPE (IX 
SSPE= 

10 mM sodium phosphate -pH 7.0, 0.18 M NaCI, and 1 mM 
EDTA) , 1% NaDodS04, 10 Atg/ml of Homomix RNA, and 10 6 
cpm/ml of labeled oligonucleotide with 10 fold excess 
unlabeled competitor at 47'C for 2 hr (Nozari et al. r 
1986, supra ) . The membrane was first washed in room 
temperature with 6X SSC three times for 30 min (IX SSC = 
150 mM NaCI and 15 mM sodium citrate) . A subsequent wash 
in TMAC1 solution for 1 hr at 59 °C removed all detectable 
mismatch hybridization. (TMACl = 50 mM Tris, pH 8.0, 3 M 
tetramethylammonium chloride, 2 mM EDTA f 0.25% SDS) . The 
results are shown in Figure 10. 

Class III ASPs of particular interest include the 
common mutations that cause CpG dimers to mutate into TpG 
or CpA dimers. Class III ASPs are also especially 
valuable for cloned genes which have not been shown to 
have RFLPs . If two oligonucleotides, one that includes 
the wild-type sequence and one that includes the altered 
sequence, are synthesized, then all of the three possible 
genotypes can be analyzed unambiguously. Homozygotes for 
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the presence of one allele will hybridize with the first 
oligonucleotide; homozygotes for presence of the second 
allele will hybridize with the second oligonuclieotide; 
heterozygotes will hybridize with both. 

In general, high molecular weight genomic DNA, 
purified by phenol/chloroform extraction of freshly 
prepared leukocyte nuclei from newly collected heparinized 
venous blood, is used as the template for primer initiated 
amplification of specific target sequences by PCR. 

One aspect of the invention involves the discovery 
that the partially purified DNA (for example, by glass 
powder binding method) 12/ from stored nuclei frozen in 
buffer or in ethanol at about -20 °C works well as a 
template in PCR, compared to the highly purified DNA from 
freshly prepared nuclei isolated from the same blood 
specimen. Partially purified DNA prepared by other 
methods (e.g. digestion of stored nuclei with 0.4M KOH at 
80 °C for 10 min followed by neutralization with 2N 
perchloric acid at 0.-4°C)20/ failed to amplify specific 
sequence(s) initiated by the primer set(s) . 

19/ vogelstein, B., et al., Preparation and analytical 
purification of DNA from agarose, Proc . Natl . Acad . Sci . USA 
76:615-619 (1979). 

20/ Mclntyre, P., et al., A quantitative method for 
analyzing specific DNA sequences directly from whole 
cells, Anal.Biochem. 174 :209-214 (1988). 
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Figure 11 shows one aspect of the gene mapping utility 
of ASP technology. A series of i PCR amplified DNA 
target sequences is depicted. Each amplified target 
sequence has the length x x + x 2 . + C*Mi, where x x is the 
distance from the first primer to the marker M, x 2 is a 
constant distance, C is a spacing constant, and Mi is the 
marker unique to the ith locus. If a series of loci is 
being analyzed, the ith amplified target sequence 
including the Marker Mi is derived from the sequence at 
the ith locus in one individual. If a series of 
individuals is being analyzed for the same locus, the P2 
primer is different for each individual so that the ith 
amplified target sequence is derived from the sequences at 
a specific locus in the ith individual. 

For example, in a gene mapping experiment, a series of 
80 markers (Mi where i = 1/80) well spaced along the 
genome are selected. For the genotyping of individuals at 
one or more disease-causing loci, the 80 markers represent 
target sequence variations or polymorphisms at the marker 
loci. Oligonucleotide primers are synthesized and 
positioned so that each of the markers is included in a 
sequence of unique, locus specific length. For example, 
when restriction enzymes are used as with Class I "and some 
Class II ASPs, the uncut amplified target sequence lengths 
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could be xi. + X2 + C*Mi, and the cut target sequence 
lengths xi and X2 + C*Mi the locus specific sequence, 
where C is the number of bases desired between cut 
sequences. In the case of Class II ASPs which do not 
involve restriction site polymorphism and Class III ASPs, 
the series could be simply X2 + C*Mi. 

At least four, and preferably five, automation devices 
are used in ASP technology. Figure 12 illustrates one 
combination of these devices useful for the complete 
automation of genotyping and gene mapping experiments. 
Referring to Figure 12, the combination includes a 
computer 1, a sample management system 2, an 
oligonucleotide synthesis machine 3, an oligonucleotide 
amplification machine 4 , and an amplified DNA target 
sequence analyzer 5. 

So understood, the invention includes a single device 
or system capable of creating oligonucleotides, running 
amplification and enzyme reactions, and loading reaction 
products onto an automated gel eletrophoresis instrument, 
all controlled through a computer with logic for 
calculating likelihoods of models from pedigree data,^!/ 

21/ Elston, R.C. and Stewart, J., A general model for the 
genetic analysis of pedigree data, Hum , Hered . 21:523-542 
(1971); Cannings, C , et al., Probability functions on 
complex pedigrees, Adv . App 1 . Pr ob 1 . 10:26-91 (1978) . 
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for artificial intelligence capabilities, and for 
oligonucleotide design and control of gene mapping 
experiments and for conducting a search for ASPs in a 
region of interest. 

Fragment length differences can be very small and 
thousands of ASPs could be assayed simultaneously on one 
gel. Bishop and SkolnicklZ/ have shown that under many 
circumstances, gene mapping experiments can yield a set of 
likely localizations for a gene with a reasonably small 
number of observations. For mapping a rare dominant 
disease to a set of genetic markers which are well spaced 
along the genome and have two common alleles, fewer than 
6,000 data points are needed to obtain an expected LOD 
score of 3 between the disease locus and the closet 
genetic marker. Therefore one gel would suffice to 
indicate a set of the most probable map locations for a 
gene. A second gel would permit selection of the correct 
location and simultaneously provide a fine structure map. 

22/ Bishop, D.T., and Skolnick, M.H. , Genetic Markers and 
Linkage Analysis. In "Banbury Report 14: Recombinant PNA 
Applications to Human Diseases" (T. Caskey and R. White, 
Eds.), pp. 251-259, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, N.Y. (1983) and Bishop, D.T. and Skolnick, 
mTh: Numerical considerations for linkage studies using 
polymorphic DNA markers in humans. In: Banbury R e P° r 5 
No. 4: Cancer Incidence in Defined P opulations (J; Cairns, 
J.L. Lyon, M. Skolnick, eds.), New York: cold Spring 
Harbor Laboratory, pp. 421-433 (1980) . 
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If 10 PCRs were prepared in the same reaction, and 50 
reactions were prepared per run of a PCR machine, the 
machine would then have to be loaded 12 times to produce 
enough reactions to load the initial gel. If a PCR 
machine were loaded twice a day, and two PCR machines 
utilized at. a time, then a gene would be localized in one 
work-week. The experiment would proceed even faster r if 
the PCR machine were further automated and interfaced with 
an intelligent controller so that reactions occurred 
around the clock and were collected until the gel was 
ready to be loaded. 

In addition to straightforward gene mapping 
expeariments „ ASPs will be useful in a number of other 
applications. For example, hospital genotyping 
individuals for a specific disease will have rapid, 
inexpensive results. Complex genotyping for a large 
faorctly can be obtained in a single, rapid experiment. A 
saingle run through the PCR reaction machine will give a 
genotypical profile. of an individual at hundreds of loci. 
Such typing is ideal for paternity testing, forensic 
medicine , or genotyping an individual for loci determining 
important susceptibilities. For frequent 2 allele 
polymorphisms, the genotypes of two individuals selected 
at: random from a population have a probability of less 
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than 1/2 of being identical. Therefore, with 40 marker 
loci, the probability of wrongly identifying a random 
individual through his genotype is less than 1 in a 10 

A number of gene mapping problems require a high 
density of markers, e.g., the autozygosity method to map 
rare recessives, 2 .!/ the resolution of genetic 
heterogeneity by simultaneously mapping to multiple 
loci, 24/ and mapping traits that are due to interactions 
of multiple loci, rare conditions, or highly complex 
predispositions with low penetrance. Simultaneous ASPs 
allow the necessary density of markers to be achieved with 
a reasonable level of effort, thus making these approaches 
feasible. 

ASP technology opens up new possibilities for the 
study of recombination. A dense map of markers will allow 
one to identify crossovers precisely, infer chiasma 

23/ Smith, C.A.B. , The detection of linkage in human 
SSnetics. J. R. Stat. Soc. B 15:153-192 (1953); Lander, 
E.S. and Botstein, D. , Homozygosity mapping: A way to map 
human recessive traits with the DNA of inbred children, 
Science , 236:1567-1570 (1987). 

24/ Lander, E.S. and Botstein, D. , Strategies for 
studying heterogeneous genetic traits in humans by using a 
linkage map of restriction fragment length polymorphisms, 
Proc. Natl. Acad. Sci. USA 83:7353-7357 (1986). 
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distributions, and utilize the information for an analysis 
of chiasma-based interference. 25/ T h e ability to analyze 
ASPs on DNA amplified from a single sperm or diploid 
celllf/ creates new possibilities in gene mapping, 
genotyping and the analysis of recombination. If each 
gamete were treated as a haploid offspring so that 
literally thousands of meiotic products of a single 
individual could be studied, the map order of very close 
markers could be resolved and the study of recombination 
would become even more specific. 

ASP technology will also greatly facilitate 
agricultural experiments in which a gene found in one 
strain is inserted into another strain through controlled 
breeding experiments. Genetic markers are required to 
select the offspring who retain the gene crossed into the 
strain, who also retain as much of their original genetic 
compositions as possible. 



25/ Goldgar, D.E., and Fain, P.R. , A mixed model of 
genetic interference, Amer. J. Hum. Genet. 41S:A167 (1987) . 

26/ Li, H.H., et al., Amplification and analysis of DNA 
sequences in a single human sperm and diploid cells, 
Nature 335 :414-417 (1988) . 
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WE CLAIM: 

1. A method for simultaneously determining the 
genotype at a plurality of polymorphic loci in a 
polynucleotide target sequence, each of said loci having a 
preselected measurable locus specific characteristic which 
comprises : 

amplifying at least one polynucleotide target 
sequence which may or may not include such loci to 
provide an amplification product containing a 
plurality of said target sequences; and 

analyzing the amplification product to determine 
the presence or absence of target sequences which 
include said preselected locus specific characteristic. 

2. A method as defined in claim 1 in which the locus 
specific characteristic at each loci is (i) a restriction 
site, (ii) the length of a restriction fragment, (iii) a 
sequence complementary to an allele-specif ic 
oligonucleotide probe, (iv) the length of a repeat unit in 
a VNTR sequence, or (v) a labelled DNA sequence. 

3. A method as. defined by claim 1 or claim 2 in which 
the target sequence is amplified by the polymerase chain 
reaction. 

4. A method as defined by claim 1 or claim 2 in which 
the target sequence is amplified by the ligation 
amplification reaction. 



WO 89/10414 



PCT/US89/01807 



-32- 

5. A method as defined by claim 1, 2 or 3 in which 
the polynucleotide target sequence is a DNA target 
sequence. 

6. A method as defined by claim 1 in which the 
polynucleotide target sequence is a DNA target sequence 
and said sequence is amplified by the polymerase chain 
reaction. 

7. A method for determining the genotype at a 
plurality of polymorphic loci in a polynucleotide target 
sequence, which loci may or may not include a preselected 
locus specific characteristic due to (i) deletion/ 
insertion variations, (ii) variable number tandem repeats , 
(iii) base pair substitution, or (iv) small consistent 
deletions which comprises: 

amplifying polynucleotide target sequences which 
may or may not include said preselected locus specific 
characteristic to produce a product containing a 
plurality of amplified target sequences of different 
length; 

the length of amplified target sequences in said 
amplification product having been predetermined by the 
positioning of the primers utilized in the 
amplification process on the target sequence; and 

visualizing the differences in the lengths of 
target sequences in the amplification product. 
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8. A method as defined by claim 7 in which the target 
sequence is a DNA target sequence which is amplified by a 
polymerase chain reaction. 

9. A method for simultaneously determining the 
genotype at a plurality of polymorphic loci in a 
polynucleotide target sequence, which sequence may or may 
not include a preselected locus specific characteristic 
restriction site which comprises: 

amplifying a target sequence; 

digesting the amplification product with a 
restriction enzyme effective to cleave said product at 
such preselected restriction sites, if present; and 

analyzing at least some of the cleave fragments. 

10. A method as defined by claim 9 in which at least 
some of the cleavage fragments are analyzed by visualizing 
the length thereof. 

11. A method for determining the genotype of a 
plurality of polymorphic loci in a DNA target sequence, 
which sequence may or may not include a preselected 
genotypical enzyme restriction site which comprises: 

amplifying a DNA target sequence spanning the 
said loci by a polymerase chain reaction to produce an 
amplification product containing a plurality of DNA 
target sequences ; 
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digesting the amplification product with a 
restriction enzyme effective to provide fragments 
flanking said restriction sites, if present, in the 
DNA target sequences included in said amplification 
product; 

the length of said fragments having been 
determined by preselection of the number of 
nucleotides separating said restriction sites on said 
target DNA from at least one of the oligonucleotide 
primers utilized in said polymerase chain reaction; and 

visualizing the lengths of said fragments. 

12. A method for the creation of new genetic markers 
and the simultaneous determination of multiple genetic 
markers in a genome DNA sample, comprising: 

amplifying said genomic DNA sample to provide an 
amplification product containing a plurality of 
sequences which may or may not include said DNA 
sequence variation; and 

determining the presence or absence of DNA 
sequence variation in said plurality of amplification 
product sequences. 

13. The method of claim 12 in which the sequence 
variation in said genomic DNA sample comprises a cleavage 
site of a restriction enzyme. 
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14. The method of claim 12 in which the sequence 
variation in said genomic DNA sample comprises the 
deletion or insertion of a nucleotide. 

15. The method of claim 12 in which the sequence 
variation in said genomic DNA sample comprises the 
p resenC e of variable number tandem repeats (VNTRs) in said 
DNA sample. 

16. The method of claim 12 in which the sequence 
variation in said genomic DNA sample comprises base pair 
substitution polymorphism. 

17. The method of claim 12 in which the sequence 
variation in said genomic DNA sample comprises small 
consistent nucleotide deletions. 

18. The method of claim 12 in which the sequence 
variation is detected by digestion of the amplified DNA 
sample with a restriction enzyme and visualizing the 
resulting DNA fragments. 

19. A method for the simultaneous determination of 
multiple genetic markers comprising: 

amplifying. by a polymerase chain reaction a 
genomic DNA sample spanning a preselected enzyme 
restriction site; 

digesting the amplification product with a 
restriction enzyme effective to cleave said product at 
said preselected restriction site; 
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the length of said fragments produced by said 
cleavage having been predetermined by the number of 
nucleotides separating said restriction site of said 
sample from at least one of the oligonucleotide 
primers used in said polymerase chain reaction; and 

analyzing the cleavage product to determine the 
presence or absence therein of fragments of said 
predetermined length. 

20. A method as defined by claim 19 in which the 
presence or absence in said cleavage product of a fragment 
of predetermined length is determined by gel 
electrophoresis. 

21. A method as defined by claim 19 in which the 
presence or absence in said cleavage product of a fragment 
of determined length is determined by the location of said 
fragment on a gel. 

22. A method as defined by claim 19 in which the 
presence or absence of said cleavage product of a fragment 
of determined length is determined by the elapsed time 
required for a fragment to appear at a predetermined 
location on a gel. 
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23 • A method for determining the presence or absence 
in a DNA target sequence of at least one locus having a 
preselected, measurable locus specific characteristic 
flanked by first and second alleles which comprises: 

(i) subjecting said DNA target sequence to a 
polymerase chain reaction using first and second 
primer sets 

said first set of primers being 
complementary to both o-f said first and second 
alleles, 

one of said second set of primers being 
specific only for a first or second allele unique 
to said locus, 
to produce a PCR product in which the only amplified 
sequences include said preselected locus specific 
characteristic typical of said locus, and 

(ii) determining the presence or absence in said 
PCR product of sequences including said preselected 
locus specific characteristic. 

24. A method as defined by claim 23 in which step 
(ii) is accomplished by simultaneously separating the 
sequences including said preselected genus specific 
characteristic by gel electrophoresis. 
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25. A method as defined by claim 23 or claim 24 in 
which, in step (i) , a plurality of DNA target sequences 
are simultaneously subjected to a polymerase chain 
reaction using first and second primer sets designed for 
the allele specific amplification of each of the loci in 
said DNA target sequences having a preselected locus 
specific characteristic. 

26. A method as defined by claim 3 in which the 
amplification product is analyzed by 

(i) forming a primer template duplex in which 
the template is an amplified polynucleotide target 
sequence which may or may not include said locus 
specific characteristic and the primer is annealed to 
said template immediately 3' and adjacent to the 
-nucleotide involved in said locus specific 
characteristic; 

(ii) subjecting said duplex to a polymerase 
chain reaction in the presence of a labelled adNTP in 
which N is a nucleotide complementary either to the 
next nucleotide in said target sequence which includes 
said locus specific characteristic or to the next 
nucleotide in said target sequence which does not 
include said locus specific characteristic, thereby 
producing a labeled locus specific extension product 
one nucleotide longer than said primer; and 
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(iii) determining the presence or absence of 
said locus specific extension product in the 
polymerase chain reaction product of step (ii) . 

27. A method as defined by claim 26 in which said 
locus specific characteristic is an endonuclease 
restriction site. 

28. A method which comprises simultaneously 
amplifying a template DNA target sequence utilizing a 
plurality of primer sets so designed and positioned with 
respect to said template as to provide an amplification 
product including a plurality of amplified target 
sequences each of unique length and simultaneously 
separating said amplified target sequences of different 
length by gel electrophoresis. 

29. A method which comprises simultaneously 
amplifying by the polymerase chain reaction a DNA target 
sequence having a plurality of polymorphic loci each of 
said loci having a preselected, restriction site 
polymorphism flanked by unique alleles, 

utilizing in said simultaneous polymerase chain 
reaction a plurality of primer sets designed to 
provide an amplification product having flanking 
alleles of a length unique for each of said loci, 
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clcaving the polymerase chain reaction product at 
said preselected restriction sites to produce a 
plurality of alleles of a length unique to each of 
said loci, and 

determining the presence or absence of alleles 
unique to each of said loci by gel electrophoresis. 
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Hypothetical AS-PCR pattern. Indi 
vidual is heterozygous for loci 
'1,3,6,8 and 10. PI and P2 are 
primer sets specific for allele 1 
and allele 2 of the particular 
locus. 
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